Articles
Reading Between the Pixels: Failure Modes in Vision Language Models
6 min read
This post is Part 2 of a two-part series on multimodal typographic attacks. In Part 1 of “Reading Between the Pixels,” we demonstrated that text–image embedding distance correlates with typographic prompt injection success: conditions that push....
Reading Between the Pixels: Assessing Prompt Injection Attack Success in Images
6 min read
This post is Part 1 of a two-part series on multimodal typographic attacks. This blog was written in collaboration between Ravi Balakrishnan, Amy Chang, Sanket Mendapara, and Ankit Garg. Modern generative AI models and agents increasingly treat...